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Abstract — The capacity region of a broadcast channel consist- 
ing of fc-receivers that lie in a less noisy sequence is an open 
problem, when k > 3. We solve this problem for the case k = 3. 
Generalizing this result, we prove that superposition coding is 
optimal for a class of broadcast channels with a sequence of less 
noisy receivers. The key idea is a new information inequality 
for less noisy receivers, which may be potentially useful in other 
problems as well. 

I. Introduction 

Consider the problem of reliable communication of k in- 
dependent messages M±,...,Mk over a discrete-memory less 
broadcast channel (DM-BC), to fc-receivers Yi, Y 2 , . . . , Yjt 
respectively. A (2 nRl x • ■ • x 2 nRk , n) code for the DM-BC 
consists of: (i) a message set [1 : 2 nRl ] x • • ■ x [1 : 2 nRk ], (ii) 
an encoder that assigns a codeword x n (mx, . . . , m^) to each 
message-tuple (mi, . . . , m^), and (iii) k decoders, decoder I 
assigns an estimate mi(yV' 1 ) £ [1 : 2 nRl ] or an error message 
e to each received sequence yf x , 1 < I < k. We assume 
that the messages are uniformly distributed over is uniformly 
distributed over [1 : 2 nR ] x • • • [1 : 2 nRk }. The probability of 
error is defined as P { e n) = P{uf =1 M; ^ M}. 

A rate-tuple (i?i, ■ ■ ■ ,Rk) is said to be achievable if there 
exists a sequence of (2 nRl x • • • x 2 nRk , n) codes with Pe — > 
as n — s> 00. The capacity region is defined as the closure of 
the union of all achievable rates. 

Definition 1: A receiver Y s is said to be less noisyQ than 
receiver Y t if I(U; Y s ) > I(U; Y t ) for all U -> X -> (Y s ,Y t ). 

We denote this relationship(partial-order) by Y s >z Y t . 

Remark 1: Observe that this partial-order only depends on 
the marginal distributions p{y s \x) and p(yt\x). 

Definition 2: A fc-receiver less noisy broadcast channel is 
a fc-receiver discrete memoryless broadcast channel where the 
receivers satisfy the partial order Yx y Y2 >r • • • t Yk- 

The capacity region for the 2-receiver broadcast channel 
was established (Proposition 3 in J3|) to be the union of rate 
pairs (Ri,R 2 ) satisfying 

Ri <I(X;Y X \TT) 
R2 < I{U; Y 2 ) 

over all choices of (U, X) such that U — > X — >• (Yi , Y2) forms 
a Markov chain. 



The extension of this result to fc-receivers is open, k > 3. 
In this paper we present a simple proof for the case fc = 3. 

Further our proof can also be used to provide an alternate 
(much-simpler) proof for fc = 2, although it must be noted 
that the original proof provides a strong-converse while ours 
provides a weak-converse. A modern-day weak converse proof 
for the 2-receiver case may also be obtained using the outer 
bounds in J4), J2), J6], however each of these uses Csiszar sum 
lemma which has no natural generalization to three receivers. 
Instead our proof relies on a information inequality (Lemma 
[TJ valid for less noisy-receivers which helps us by-pass the 
use of Csiszar sum lemma. 

Indeed using this lemma one can also obtain the capacity 
region for a subset of fc-receiver less noisy broadcast channel 
(which contains the 3-receiver less noisy broadcast channel as 
well). However for clarity of exposition, we shall first establish 
the result for the 3-receiver less noisy broadcast channel and 
then present the general result for the class of fc-receiver less 
noisy broadcast channel. 

II. Three-receiver less noisy broadcast channel 

The main result of the paper is the following: 

Theorem 1: The capacity region of a 3-receiver less noisy 

discrete memoryless broadcast channel is given by the union 

of rate triples R 2 , R3) satisfying: 

Ri < I(X;Y!\V) 
R2 <I(V;Y 2 \U) 
R 3 <I(U;Y 3 ) 

over all choices of (U,V,X) such that U -> V — > X -> 
(Yi, 5*2) ¥3) forms a Markov chain. Further it suffices to 
consider \U\ < \X\ + 1, \V\ < (\X\ + l) 2 . 

A. Achievability 

The rate-triples are achievable using superposition coding 
and jointly typical decoding. The arguments are standard in 
literature and hence only a minor outline is provided. 

Consider a (U,V,X) such that U -> V ->■ X ->■ 
(Yi, Y2, Y3) forms a Markov chain. We will show the achiev- 
ability of any rate-triple satisfying R3 < I(U;Yz), R 2 < 
I{V-Y 2 \U),R 1 </(X;Y 1 |y). 



The encoding proceeds as follows: 

• Generate 2 Iii?3 sequence u n (m 3 ) ~ Y[7=iPu{ u i)- 

• For each m 3 , generate 2 nR2 sequences v n (m 2 ,m 3 ) dis- 
tributed according to Y[i=iPv\u( v i\ u i)- 

• Finally for each (7712,013) pair, generate 2 nRl 
x n (mi, 7712,7713) sequences distributed according to 

nr=iPxiv,tf(a*K)«*) = nr=iPxiv(^i^)- 

Receiver Y3, upon receiving y^, assigns M 3 = 7773 if there 
is a unique sequence u n (m 3 ) such that the pair (u n (m 3 ), y^) 
is jointly typical; otherwise receiver Y 3 declares an error. 
This decoding succeeds with high probability as long as 
R 3 <I{U;Y 3 ). 

Receiver Y2 performs successive decoding. (This is in gen- 
eral worse than joint decoding, but in this situation successive 
decoding is enough.) Upon receiving y 21 , assigns M 3 = 777,3 
if there is a unique sequence u n {m 3 ) such that the pair 
(u n (7773), 2/21) is jointly typical; otherwise receiver Y 2 declares 
an error. Assuming if finds a unique 77,™ (777,3) sequence, it then 
assigns M 2 = m 2 if there is a unique sequence v n {m 2 ,m 3 ) 
such that the triple (it" (777,3 ), u " (mj, 777,3), 2/21) is jointly typ- 
ical; otherwise receiver Y2 declares an error. The first step 
of decoding succeeds with high probability as long as R 3 < 
I(U;Y 2 ), but this holds as I(U;Y 2 ) > I(U;Y 3 ) (since Y 2 is 
a less-noisy receiver than Y 3 ). The second step of decoding 
succeeds with high probability as long as R 2 < I(V;Y 2 \U). 

Similarly, receiver Y\ also performs successive decoding. 
The three steps of decoding will succeed with high probability 
as long as the conditions R 3 < I(U;Y 1 ),R 2 < I(V;Yi\U), 
and Rt < I(X;Y 1 \V,U) = I{X;Y X \V) hold. Since Y x t 
Y 2 h Y 3 the first two conditions are automatically satisfied. 
This completes the proof of achievability. ■ 

B. Converse 

The interesting part of this proof is the converse, and in 
particular the use of Lemma[T]to identify the auxiliary random 
variables. 

Lemma 1: Let Y s >z Y t , and M be any random variable 
such that 

M^X n -> {Y s y,Y t ^) 

form a Markov chain. Then the following hold: 

1) IiY^Yt^M) > itf^YtdM), 1 < z < 77. 

2) itf^Y.dM) > I{Y}^ l ;Y Stj \M), 1 < 1 < 77. 
Proof: The proof of Part 1 follows by progressively 

flipping one co-ordinate of Yjj - to Y£[~ , and showing that 
the inequality holds at each flip using the less-noisy (Y s y Yt) 
assumption. 

Observe that for any 1 < r < i — 1 

IiY^^-YulM) 

= IiYl^X^YAM) + I(Y s y, Y tii \M, Yf^X-lJ 

> KYl^X^Y^M) + I(Y t>r ; Y U \M, Y t ^\ Y^) 

= i(y^,F;-| i; y ti |M) ) 

where (a) follows from the following two observations: 



• (M, YJT{~ 1 , 7 Yti ) X r -> (y s , r ,F t , r ) forms a 
Markov chain 

• The receiver Y s is less noisy than Y t implying, in partic- 
ular, that 

/(F a>r ;y t , i |M > 3^f 1 ,y^ 1 ) > /(y t>r ;y t , i |M > 3^f 1 ,^ 1 ). 
This yields us a chain of inequalities of the form 

i(i£ 1 ; y M |M) > /(y^y^y^M) > ... 
• •• > 7(y t ^ 2 ,y M -i;yti|M) > /(y^y^M), 

thus establishing the Part 1 of the Lemma. 

The proof of Part 2 follows identical arguments (replace Y t i 
by Y s i) as in the proof of Part 1 and is omitted. ■ 

The main converse follows using Fano's inequality and the 
above lemma. 

Observe that 

nR 3 ^HM^Y^+nen 

n 

= Y / HM 3 ;Y 3 , l \Y*- 1 1 ) + ne n 

i=l 

n 

<^J(M 3 ,y 3 i i I 1 ;y 3 ^+ne„ 

i=l 
(a) n 

< ^/(M3,y 2 ^ 1 ;y 3) i)+ne„ 

i=l 
n 

= ^ J(f7i; y 3li ) +ne„, 

i=l 

where Ui = {M 3l Y 2 l \ l ). Here (a) follows from LemmaQ] 
From Fano's inequality we also have 

nR 2 <I(M 2 ;Y^\M 3 ) + ne n 

n 

= I(M 2 ;Y 2 ,i\M 3 , Y^ 1 ) + ne n 
i=i 

n 

= Y,I(V i ;Y 2!i \U i )+ne n , 

i=l 

where V t = (A/ 2 , M 3 , K^ 1 ). 
Finally observe that 

nRi < /(Mxjl^ilMa, M 3 ) + ne n 

n 

= ^ /(M^y^lMa, MgF^I 1 ) + 77£„ 

1=1 

(a) n 

< T ( X *> Y ^\ M 2, M 3 , Y^ 1 ) + ne n 

i=l 

n 

( = } J2 HXf, Y 1;i \M 2 , Af 3 ) - /(y^I 1 ; y M |M 2 , M 3 ) + 77e„ 
1=1 

(c) " 

< I(X i; Y hl \M 2} M 3 ) - /(y^I 1 ; y M |M 2 , M 3 ) + ne„ 

i=l 



< H x u Y hi \M 2 , M 3 , Y^ 1 ) + e„ 

i=l 
n 

= ^2l(X i ;Y 1 , i \V i ) + ne n . 

i=l 

Here (a), (6), and (d) follow from the data processing inequal- 
ity and that 

(M 1: M 2 ,M 3 , Yi'I 1 , K, 1 .! 1 ) -V X, -> F u 

forms a Markov chain. The inequality (c) follows from Part 
2 of Lemma Q] 

let Q E {1)2, ...,n} to be a uniformly distributed random 
variable independent of all other random variables. Setting 
U = (Uq, Q),V = (Vq, Q), X = Xq completes the converse 
in the standard way. Clearly U — > V — > X forms a Markov 
chain as Vi = (Ui,M 2 ). 

The cardinality arguments are standard in literature (see |fl~), 
J5]), and follows using the Fenchel-Eggleston strengthening of 
the usual Caratheodory's argument. 

This completes the proof of the converse. ■ 

A natural question here is whether this approach generalizes 
to more than three receivers. It appears to the authors that to 
generalize this argument to more than three receivers, one has 
to impose additional constraints on the class of A: -receiver less 
broadcast noisy channels. Since this generalization leads to a 
rather interesting condition we shall define the class, and give 
a brief outline as to why the proof generalizes naturally under 
this setting. 

III. THE fc-RECEIVER INTERLEAVABLE BROADCAST 
CHANNEL 

Definition 3: A fc-receiver less noisy broadcast channel is 
said to belong to be an interleavable broadcast channel if there 
exists k — 1 virtual receivers V\, V k -\ satisfying: 

• X —tVi — > ... —tVk-i forms a Markov chain and 

• The following"interleaved" less noisy condition holds: 

YihVihYih--- *fc-i t Vk-i h Y k . (1) 

This class generalizes the 3-receiver less noisy broadcast 
channel. Indeed, the following broadcast channels are some 
examples belonging to this class : 

1) A sequence of degraded receivers, i.e. X — > Yi — > ... — > 
Y k ; set Vi = Y i+1 , 

2) A sequence of "nested" less noisy receivers, i.e. Y s >z 
(Y s+1 ,...,Y k );setV i = (Y i+1 ,...,Y k ), 

3) A 3-receiver less noisy sequence, i.e. Y± >z Y 2 > Y 3 \ set 
V 1 = V 2 = Y 2 . 

From Remark [T] we know that the less-noisy ordering only 
depends on the marginals. Hence without loss of generality 
we can assume that the probability distribution factorizes as 



follows: 

/ n n n n n \ 

n 

= \\p{xi\x l ~ 1 )p{yii, .., y kl ,v u , .., Vk-i,i\xi) 

i=l 
n 

= _Qp(^ik 8_1 )p(yii! ■•: Vki\Xi)p(vii, .., V k -l,i\xi) 
i=l 

n k — 1 

= Y\_p{x l \x l ~ 1 )p{yii,..,y kl \x l )p(v li \x i ) JJp(«ji|« 3 --i,i) 

«=1 J=2 

Here the first equality is due to the fact that the channel 
is DMC without feedback, second is due to the fact that the 
assumptions on the less noisy structure just depends on the 
marginals, and third is due to the Markov chain X — > V\ — > 
... -> Vfc_i. 

Given this structure we immediately observe the following 
Markov chain 

Vij 1 V^l, -> X n , Y?, . . . , Y k n , M Xl M k . (2) 

for 1 < s < k - 1; (set V = X). 

Theorem 2: The capacity region of a k-receiver interleav- 
able less-noisy discrete memoryless broadcast channel is given 
by the union of rate triples [R\, . . . ,R k ) satisfying 

Ri <I{U f , Yi\Ui+i), l<l<k, 

over all choices of (U 2 , ...U k , X) such that (U k +i = 0) — > 
U k -»■ ■••C/ 2 ->■ (E/i = X) -> (Yi,Y 2 , . . . , Y k ) forms 
a Markov chain. Further it suffices to consider |C/fe_ r | < 
(\X\ + l) r+1 , 1 < r < k - 2. 

Proof: The proof is almost identical to that of the three 
receiver broadcast channel. The achievability proof is standard 
using superposition encoding and successive decoding and is 
omitted here. 

Let Mi +1 = (Mj+i, M k ). Using Fano's inequality, 
observe that for 1 < I < k. 

nRi < I (Mi; i^ilMj+x) + ne n 

n 

= Y,HMi;Y l , i \MlX 1 ,Yfc 1 )+ne n 

i=l 
n 

= Y,I(M l ,Yfc 1 ;Y l , i \MlX 1 ) 

i=l 

-itf^Y^Mf^ + en 

< I(M h Y^;Y lti \M? +1 ) 
-/(^'r^MlAtf+O + en 

< ^MuV^MM^) 
-I^^Y^M^ + en 

® I(M l ,v£l 1 iY,, i \Mj° +1 ,V*J 1 )e n 

n 

= }^I(Ui,i;Yi t i\Ui +lt i) + ne n , 
i=i 



where U hi = (M, fe , V^)- We set V = X. Here the 
inequalities (a), (b) follow from the Lemma[T]and that Vi-i >z 
Yi >: V s -i. The equality (c) follows from the Markov chain 
in ©. 

Define Q to be a uniform random variable taking values 
in {1, .., n} and independent of all other random variables. As 
usual, we set Ui = (U^q, Q) and X = Xq. Since X — > V\ —> 
■ ■ ■ Vk-i is a Markov chain it follows that Uk — > Uk-i —> 
•••—)■ U 2 — >■ X forms a Markov chain as well. The cardinality 
arguments are again standard and omitted. This completes the 
proof of the converse. ■ 

Remark 2: It is not very difficult to observe that in general 
the 4-receiver less noisy broadcast channel is not an interleav- 
able broadcast channel. To observe this let Z\ >z Z2 be any 
pair of less noisy but not degraded (stochastically) receivers. 
(Such a pair exists, see or Q). Now let Y X ,Y 2 ~ Z x 
thus sandwiching V\ = Z\ and l^j, Y4 « Z2 thus sandwiching 
V3 = Z2- However X — > V\ — > V3 cannot be a Markov 
chain by the assumption on Z\^2- Hence the problem of 
determining the capacity of fc-receiver less noisy channel 
k > 4 is still very much open. 

IV. Conclusion 

We establish the capacity region for the 3-receiver less noisy 
broadcast channel. We also compute the capacity region for 
a class of k-receiver less noisy sequences that contain the 
previously mentioned scenario. A new information inequality 
is used to obtain the converse, and this technique also sim- 
plifies the original proof [3 l| of the converse of the 2-receiver 
broadcast channel. 
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